Distributed Network Querying with Bounded Approximate Caching
نویسندگان
چکیده
As networks continue to grow in size and complexity, distributed network monitoring and resource querying are becoming increasingly difficult. Our aim is to design, build, and evaluate a scalable infrastructure for answering queries over distributed measurements, at reduced costs (in terms of both network traffic and query latency) while maintaining required precision. In this infrastructure, each network node owns a set of numerical measurements and actively maintains bounds on these values cached at other nodes. We can answer queries approximately, using bounds from nearby caches to avoid contacting the owners directly. We focus on developing efficient and scalable techniques to place, locate, and manage bounded approximate caches across a large network. We have developed two approaches: One uses a recursive partitioning of the network space to place caches in a static, controlled manner, while the other uses a locality-aware distributed hash table to place caches in a dynamic and decentralized manner. In this paper, we focus on the latter approach. Experiments over a large-scale emulated network show that our techniques are very effective in reducing query costs while generating an acceptable amount of background traffic; they are also able to exploit various forms of locality that are naturally present in queries, and adapt to volatility of measurements.
منابع مشابه
Parallel Visual Information Retrieval in VizIR
This paper describes how parallel retrieval is implemented in the content-based visual information retrieval framework VizIR. Generally, two major use cases for parallelisation exist in visual retrieval systems: distributed querying and simultaneous multi-user querying. Distributed querying includes parallel query execution and querying multiple databases. Content-based querying is a two-step p...
متن کاملThe Cuckoo Filter: It’s Better Than Bloom
Approximate set-membership tests, exemplified by Bloom filters [1], have numerous applications in networking and distributed systems. A Bloom filter is a compact data structure to quickly answer if a given item is in a set with some small false positive probability ε . Due to its simplicity and high space efficiency, Bloom filters become widely used in network traffic measurement, packet routin...
متن کاملSemantic Cache Mechanism for Heterogeneous Web Querying
In Web-based searching systems that access distributed information providers, efficient query processing requires an advanced caching mechanism to reduce the query response time. The keyword-based querying is often the only way to retrieve data from Web providers, and therefore standard page-based and tuple-based caching mechanisms turn out to be improper for such a task. In this work, we devel...
متن کاملBounded approximate connes-amenability of dual Banach algebras
We study the notion of bounded approximate Connes-amenability for dual Banach algebras and characterize this type of algebras in terms of approximate diagonals. We show that bounded approximate Connes-amenability of dual Banach algebras forces them to be unital. For a separable dual Banach algebra, we prove that bounded approximate Connes-amenability implies sequential approximat...
متن کاملIvy: Interest- ased Data Delivery in VANET through Neighbor Caching
In this paper, we study the problem of interest-based data delivery in Vehicular Ad Hoc Networks (VANETs), which is to efficiently forward data to a vehicle that owns the given interest without knowing its ID beforehand. Such problem is generally challenging in large-scale distributed networks, because it usually requires to query a huge number of nodes in the network to find a node of interest...
متن کامل